Picture for Claire Cardie

Claire Cardie

Cornell University

Knowing but Not Showing: LLMs Recognize Ambiguity but Rarely Ask Clarifying Questions

Add code
May 24, 2026
Viaarxiv icon

Token-weighted Direct Preference Optimization with Attention

Add code
May 21, 2026
Viaarxiv icon

How Far Are We From True Auto-Research?

Add code
May 18, 2026
Viaarxiv icon

Bootstrapping Post-training Signals for Open-ended Tasks via Rubric-based Self-play on Pre-training Text

Add code
Apr 21, 2026
Viaarxiv icon

FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights

Add code
Feb 02, 2026
Viaarxiv icon

MovieRecapsQA: A Multimodal Open-Ended Video Question-Answering Benchmark

Add code
Jan 05, 2026
Viaarxiv icon

Better LLM Reasoning via Dual-Play

Add code
Nov 19, 2025
Viaarxiv icon

Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards

Add code
May 23, 2025
Figure 1 for Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards
Figure 2 for Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards
Figure 3 for Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards
Figure 4 for Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards
Viaarxiv icon

HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization

Add code
May 16, 2025
Figure 1 for HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization
Figure 2 for HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization
Figure 3 for HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization
Figure 4 for HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization
Viaarxiv icon

Between Underthinking and Overthinking: An Empirical Study of Reasoning Length and correctness in LLMs

Add code
Apr 30, 2025
Viaarxiv icon